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SYSTEM AND METHOD FOR MANAGING LARGE DATA SETS 

CROSS-REFERENCE TO RELATED APPLICATIONS 
[0001] None. 

STATEMENT REGARDING FEDERALLY SPONSORED RESEARCH OR 

DEVELOPMENT 

[0002] None. 

TECHNICAL FIELD 

[0003] Embodiments of the present invention relate to management of large data sets 

within a medical environment. More particularly, embodiments of the invention are 
directed to facilitating management of image data sets to provide immediate access to the 
image data sets within an automated medical environment. 

BACKGROUND OF THE INVENTION 

[0004] Currently, imaging data is captured at an image capturing location during a 

patient examination. Typically, a number of data sets or studies are created by the image 
acquisition device. These studies will later be reviewed on demand at a review station, 
likely having a location that is not yet determined. Accordingly, the captured data is 
stored in a central server location such as a picture archiving conununication system 
(PACS) until it is requested by a reviewer. 

[0005] In the field of medical imaging, modem techniques have created large data sets 

that can be cumbersome to transfer and store. Specialists including cardiologists need 
motion pictures at thirty or more frames per second. Each frame could be several 
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megabytes or more in size. X-ray Angiography studies can average around one gigabyte 
or more. 

[0006] When a physician is ready to review the studies, the physician will go to a review 

station to read and analyze the studies. Physicians may require that the study be available 
for reading inmiediately upon demand. Because of the large size of the data sets and the 
limited speed of hospital networks, simply retrieving the studies on demand may yield 
unacceptably long waiting times for physicians. 

[0007] Several solutions have been proposed to the aforementioned problem. One 

solution is for the central data server to auto re-route the collected studies to all 
diagnostic stations. While this solution makes the images inunediately available to 
physicians, it causes excessive network traffic and consumes a large amount of disk 
space. 

[0008] Another solution created to solve this problem has been to automatically route the 

studies to a small set of review stations based on predictive algorithms. This solution 
also has drawbacks. For instance, if a physician selects a review station that was not 
predicted by the predictive algorithm, the physician may be required to wait an excessive 
amount of time for all of the studies to download. Alternatively, the physician may be 
forced to find a review station where the predictive algorithm routed the studies. 
Additionally, those stations selected by the predictive algorithm lose valuable storage 
space. 

[0009] Accordingly, a solution is needed for making the large data sets available to the 

physician that does not require predicting physician location. A solution is also needed 
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that does not auto-route all data sets to all review stations, thus creating excessive 
network traffic and consuming excessive memory resources. . 

BRffiF SUMMARY OF THE INVENTION 
[0010] In one aspect, the present invention is directed to a method for managing large 

data sets (or studies) transferred from at least one acquisition device to a study process 
server in order to transfer the data sets to at least one review station. The method 
includes sorting each received study into appropriate working sets, selecting a subset of 
studies from at least one working set, and distributing the selected subset of studies to at 
least one review station. 

[0011] In an additional aspect of the invention, a method is provided for managing large 

data sets transferred from at least one acquisition device to a study process server in order 
to transfer the data sets to at least one review station. The method includes transferring a 
selected subset of the existing data sets from the study process server to each review 
station and monitoring each review station for selected user activities, for example a 
login. The method additionally includes populating the review station with studies from 
a relevant working set upon detecting the selected user activities. 

[0012] In an additional aspect of the invention, a system is provided for managing large 

data sets transferred from at least one acquisition device to a study process server in order 
to transfer the data sets to at least one review station. The system includes a study sorting 
module for sorting each study received by the study process server into an appropriate 
working set and a study control module for selecting a subset of studies from at least one 
working set. The system additionally includes a study distribution module for 
distributing the selected subset of studies to each review station. 
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[0013] In yet an additional aspect of the invention, a system is provided for managing 

large data sets transferred from at least one acquisition device to a study process server in 
order to transfer the data sets to at least one review station. The system includes a study 
distribution module for transferring a selected subset of the existing data sets to at least 
one review station and a study control module for monitoring each review station for 
selected user activities. The study distribution module populates the review station with 
studies from at least one relevant working set upon detection of the user activities by the 
study control module. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0014] The present invention is described in detail below with reference to the attached 

drawings figures, wherein: 
[0015] FIG. 1 is a block diagram illustrating components of a system for managing large 

data sets in accordance with an embodiment of the invention; 
[0016] FIG. 2 is a block diagram illustrating components of a central control server in 

accordance with an embodiment of the invention; 
[0017] FIG. 3 is a block diagram illustrating components of a study process server in 

accordance with an embodiment of the invention; 
[0018] FIG. 4 is a block diagram illustrating components of a working set associated with 

a central control server in accordance with an embodiment of the invention; 
[0019] FIG. 5 is a flow chart illustrating a method for transferring data to a review station 

in accordance with an embodiment of the invention; and 
[0020] FIG. 6 is a flow chart illustrating further aspects of a method for transferring data 

to review stations in accordance with an embodiment of the invention. 
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DETAILED DESCRIPTION OF THE INVENTION 
[0021] Embodiments of the present invention are directed to a system and method for 

managing large data sets to maximize efficiency. In particular, the system manages 
transfer of large data sets to review stations for physician viewing. Techniques of this 
invention optimize network and disk space utilization while satisfying the need for 
cardiology images on demand. 
[0022] As will be further explained in conjunction with the FIGs. below, the system of 

the invention may prime all work review stations with a small set of studies, so that when 
a physician does log in, he can begin reading while the system then begins to retrieve 
more studies to be read. 

[0023] FIG. 1 illustrates a system for managing large data sets in accordance with an 

embodiment of the invention. Multiple acquisition devices 10 capture data and transmit 
the data over a network to a central control server 30. The central control server 30 
preferably includes a study process server 60 and stored working sets 40. The working 
sets 40 are preferably stored in a database 150 as described below with respect to FIG. 2. 
The study process server 60 controls the distribution of the data stored in the working sets 
40 over a network to a plurality of review stations 20, where physicians will be able to 
review the studies. The review stations may be diagnostic or non-diagnostic. The study 
process server 60 may prime each review station 20 with a number of studies so that a 
physician will have studies available for review on demand. 

[0024] The study process server 60 creates the subset of distributed studies in any one of 

a number of ways that will be further explained below. By priming the review stations 
20, the system of the invention gives the physician a starting set, while additional studies 
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load in the background. When a physician logs in at any given review station 20, the 
study process server 60 knows where the reading and documentation will be done. As 
the review station 20 is already primed with a set of studies to review, the physician has 
studies on demand to begin working. 
[0025] To accomplish the aforementioned goal, the central control server 30 tracks the 

studies that need to be reviewed in a working set. As explained further below, 
components of the central control server 30 then distribute studies from the working set 
across all review stations 20 so that each station has some studies from the working set. 
The system of the invention can customize the degree of replication of studies across 
different review stations and the fraction of the working set distributed can be customized 
to the capacity of the underlying network and storage systems as well as the typical work 
pattems of the readers. 

[0026] FIG. 2 illustrates an embodiment of the components of the central control server 

30. The central control server 30 may include a processing unit 122, a peripheral 
interface 124, a user interface 126, and a network interface 128. The central control 
server 30 may also include a memory 130. A system bus 129 couples the aforementioned 
components. The central control server 30 may also include a database 150. 

[0027] The system memory 130 may include computer storage media in the form of 

volatile and/or nonvolatile memory such as read only memory (ROM) 132 and random 
access memory (RAM) 140. A basic input/output system 134 (BIOS), containing the 
basic routines that help to transfer information between elements within the central 
control server 30, such as during start-up, is typically stored in ROM 132. RAM 140 
typically contains data and/or program modules that are immediately accessible to and/or 
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presently being operated on by processing unit 122. For instance, the study process 
server 60 may be contained within the RAM 140 of the system memory 130. 

[0028] By way of example, and not limitation, FIG. 2 illustrates operating system 142, 

apphcation programs 144, other program modules 146, and program data 148. The 
application programs 144 and other programs 146 may be described in the general 
context of computer-executable instructions, such as program modules, being executed 
by a computer. Generally, program modules include routines, programs, objects, 
components, data structures, etc. that perform particular tasks or implement particular 
abstract data types. Moreover, those skilled in the art will appreciate that the invention 
may be practiced with other computer system configurations, including hand-held 
devices, multiprocessor systems, microprocessor-based or progranmiable consumer 
electronics, minicomputers, mainframe computers, and the like. 

[0029] The central control server 30 may also include other removable/non-removable, 

volatile/nonvolatile computer storage media. A hard disk drive may be provided that 
reads from or writes to non-removable, nonvolatile magnetic media, a magnetic disk 
drive that reads from or writes to a removable, nonvolatile magnetic disk, and an optical 
disk drive that reads from or writes to a removable, nonvolatile optical disk such as a CD 
ROM or other optical media. Other removable/non-removable, volatile/nonvolatile 
computer storage media that can be used in the exemplary operating environment 
include, but are not limited to, magnetic tape cassettes, flash memory cards, digital 
versatile disks, digital video tape, solid state RAM, solid state ROM, and the Uke. The 
hard disk drive is typically connected to the system bus through a non-removable 
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memory interface and magnetic disk drive and optical disk drive are typically connected 
to the system bus by a removable memory interface. 

[0030] A user may enter conmiands and information into the central information system 

through the user interface 126 using input devices such as a keyboard and pointing 
device, commonly referred to as a mouse, trackball or touch pad. Other input devices 
may include a microphone, satellite dish, scanner, or the like. These and other input 
devices are often connected to the processing unit 122 through a user input interface 126 
that is coupled to the system bus 129, but may be connected by other interface and bus 
structures, such as a parallel port or a universal serial bus (USB). A monitor or other type 
of display device may also be connected to the system bus 129 via an interface, such as 
the peripheral interface 124. In addition to the monitor, computers may also include 
other peripheral output devices such as speakers and printer. 

[0031] The illustrated server system 30 is merely an example of a suitable environment 

for the system of the invention and is not intended to suggest any limitation as to the 
scope of use or functionality of the invention. Neither should the server system 30 be 
interpreted as having any dependency or requirement relating to any one or combination 
of components illustrated. 

[0032] The server system 30 in the present invention may operate in a networked 

environment using logical connections to communicate with the components shown in 
FIG. 2. Logical connections for networking may include a local area network (LAN) or a 
wide area network (WAN), but may also include other networks. When used in a LAN 
networking environment, the server system 30 may be connected to the LAN through the 
network interface 128 or adapter. When used in a WAN networking environment, the 
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server system 30 typically includes a modem or other means for establishing 
communications, such as the Internet. The modem, which may be internal or external, 
may be connected to the system bus 129 via the user input interface 126, or other 
appropriate mechanism. 

[0033] FIG. 3 illustrates an embodiment of the components of the study process server 

60. The study process server 60 may include a study sorting module 62, a study 
distribution module 64, and a study control module 66. The study sorting module 62 may 
receive the studies from the imaging stations 10 and sort them into appropriate working 
sets 40. The study distribution module 64 may distribute studies to the review stations 
20. The study control module 66 may determine which studies should be grouped 
together for distribution and additionally may control further study processing. 

[0034] FIG. 4 illustrates storage of the working sets 40. The working sets may include 

for example an XA (x-ray angiography) working set 42, an MR (magnetic resonance) 
working set 44, and a US (ultrasound) working set 50. Other examples include CR 
(computed radiography), CT (computed tomography), DR (digital radiography), NM 
(nuclear medicine) working sets. Each working set may include a plurality of studies. 
Each study may be associated with a patient name 52, a type of study 54, a unique study 
ID 55, and an indication of the study status 56. More generally, studies may be sorted 
into a working set based on the application of a number of parameters against any of a 
number of known characteristics for each study in addition to study type, status and 
patient name. By way of example and not limitation, the size of the study, facility at 
which the study was taken and time of the study may be evaluated to sort the study into 
one or more working sets. Studies that have been completed may ultimately be removed 
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from a working set. A working set may be defined as a set of studies that has not been 
"completed". In order for a study to be completed, the system may require that one or 
more concerned parties have reviewed and analyzed the study. 

[0035] In the illustrated embodiment, the working sets 40 are stored on the study process 

server 60. However, the working sets 40 may be stored at an alternative location and 
merely accessed in accordance with instructions stored on the study process server 60. 
The study ID 55 may include a distinct number for identification of each study. In an 
embodiment of the invention, the study ID is a 64 byte number. Furthermore, some 
studies may be stored in more than one working set. For example, a physician working 
set may be created that includes studies that are also included in the working set of 
another physician who requires access to the studies. 

[0036] For the particular central control server 30 and set of reviewers, there may be 

multiple working sets and different reading roles all being processed with parameters and 
preferences set according to the workflow properties of the studies in that particular 
working set. For example, the reading roles and patterns of US (echo) studies may be 
different from those of XA studies. Accordingly, these studies would be stored in 
different working sets and the study control module 66 would use different rules to 
handle the different working sets. 

[0037] In operation, imaging occurs at the acquisition devices 10 after an order has been 

created for a patient. When a patient arrives for an appointment, the patient is directed to 
an appropriate acquisition device 10 and the imaging process is performed. The 
subsequent steps are described below with reference to FIGS. 5 and 6. 
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[0038] FIG. 5 is a flow chart illustrating a method in accordance with an embodiment of 

the invention. In step A 10, the central control server 30 or other storage device receives 
one or more studies from the acquisition devices 10. It is possible to store the studies in 
an alternative device and process the studies using the instructions stored on the study 
process server 60. In step A20, the study sorting module 62 sorts each received study to 
one of the working sets 40. The central control server 30 waits for additional studies 
from the imaging stations 10 in step A30. If more studies are available in step A40, the 
central control server 30 or other storage device receives and sorts the studies. In step 
A50, the study control module 66 creates a subset of studies appropriate for distribution 
to each review station 20. In step A60, the study distribution module 64 distributes the 
created subset to the review stations 20. 

[0039] The determination of the appropriate subset of studies for distribution in step A50 

may be accomplished in a number of ways. In its most basic form, distribution or 
priming is accomplished "blindly" in that each review station 20 will receive the same 
subset of the entire set of studies received by the central control server 30. Typically, this 
subset will include the most recently acquired studies. The use of a subset rather than the 
entire set of studies received by the central control server 30 prevents excessive network 
traffic and excessive consumption of storage space. 

[0040] Alternatively, the subset may be selected based on an algorithm within the study 

control module 66. The algorithm may be either a predictive or a non-predictive 
algorithm. A non-predictive algorithm would intelligently select a subset of data based 
on the central control server's working sets, procedure types, and schedules. For 
instance, a non-predictive algorithm might take a fixed number of studies from each 
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working set, from each procedure type, or assigned to each physician. Alternatively, 
predictive algorithms may be used by the study control module 66. Predictive algorithms 
can determine appropriate review stations for distribution based on the physician 
schedule and defined working locations. This information may be stored in the central 
control server 30. The predictive algorithms may cause the distribution module 64 to 
distribute a larger subset of received studies to specifically predicted locations. 

[0041] Although step A50 is directed to selecting a single subset, it should be understood 

that the study process server may select multiple subsets in step A50. Depending on the 
particular embodiment of the invention employed, different subsets may be directed to 
different review stations or all of the review stations may receive the same subsets. 
Furthermore, the selected subset or subsets may be selected from a single working set or 
from multiple working sets. 

[0042] As illustrated in FIG. 6, when a physician needs to access studies, he logs into a 

review station 20. Upon receiving a login (from a physician or other medical 
professional) in step B02, the study process server 60 populates the review station 20 
with an appropriate data set for the physician in step B04. Once the study control module 
66 knows from which review station 20 a physician is working, the study control module 
66 can then begin filling that review station's buffer with more studies to view. The size 
of a particular buffer on a review station may be determined by the study control module 
66 based on various factors including the presence of a reader for this type of study at this 
or other review stations, the speed at which the readers are progressing, and the 
congestion of the network. 
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[0043] In step B06, the study process server 60 continues to monitor activity at the 

review stations 20. In step BOS, if a review station 20 informs the study process server 
60 that a study has been completed, the study process server 60 informs other review 
stations 20 that the study has been completed in step BIO so that the other review stations 
20 may delete the completed study in order to create space for additional studies. 
Furthermore, the study process server 60 may monitor the review stations 20 for a low 
buffer threshold. If a number of studies has been completed and deleted, the study 
process server 60 may re-populate the review station 20 in step B14. 

[0044] As set forth above, a reader completes the work on a particular study, the study 

control module 66 is notified and may remove the study from the working set. 
Additionally, if the study completed on one review station also resides on another review 
station, the study control module 66 sends a message to that review station instructing it 
to delete that study in order to create space for another study. If the deletion of that study 
results in the buffer being too small, one or more replacement studies are automatically 
forwarded. 

[0045] The review stations 20 may have pre-set "watermarks" or buffer thresholds. The 

review stations 20 preferably have a low watermark. If the low watermark is reached, the 
study process server 60 re-populates the review station 20 as described above. The 
watermark may be dynamic as it may grow and shrink based on the rate at which a 
physician reads studies on a particular diagnostic station. Altematively, the watermark 
may be configurable. A configurable watermark could be pre-set for optimization based 
on the physician, procedure, network bandwidth, disk space, etc. 
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[0046] As described above, the system provides a technique for displaying large image 

data sets required to be displayed on demand in a distributed environment. A distinct 
advantage of the system is its reduction in network traffic and excessive disk space 
requirements while preventing imaging reading delays. The above-described system 
additionally reduces the need to upgrade network and disk infrastructures. 

[0047] While particular embodiments of the invention have been illustrated and 

described in detail herein, it should be understood that various changes and modifications 
might be made to the invention without departing from the scope and intent of the 
invention. For instance, while the invention has generally been described in terms of a 
central server at which the studies are stored and subsets of the working sets are 
distributed, in embodiments the studies may be stored at one or more locations distinct 
from the central server. In these embodiments, the central server would direct the 
distribution of the studies in accordance with the invention from the distinct storage 
locations to the viewing stations. For example, a particular department may have its own 
PACS system at which studies are stored rather than transferring the studies to a central 
server. Instead, the departmental PACs would conununicate information about the 
studies to the central server required to determine which studies should be distribute to 
which viewing stations and provide such instructions to the departmental PACS. The 
embodiments described herein are intended in all respects to be illustrative rather than 
restrictive. Alternate embodiments will become apparent to those skilled in the art to 
which the present invention pertains without departing from its scope. 

[0048] From the foregoing it will be seen that this invention is one well adapted to attain 

all the ends and objects set forth above, together with other advantages, which are 
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obvious and inherent to the system and method. It will be understood that certain features 
and sub-combinations are of utility and may be employed without reference to other 
features and sub-combinations. This is contemplated and within the scope of die 
appended claims. 



-15- 



